According to BBC, streaming online platforms became more popular than ever during the lockdown provoked by the COVID-19 pandemic. For instance, in UK during April 2020, 40% of the waking hours were spent in front of the screens, on average. Because of the growing popularity of online streaming services all over the world and the increasing variety of content available on such platforms, in this report we propose to analyze 4 major platforms: Netflix, Prime Video, Hulu and Disney+ in terms of quality. We specifically concentrate on movies that are available on these platforms in the United States. Depending on the platform, the monthly price for subscription varies: in 2021, Netflix costs - 9$, Prime Video - 9$, Hulu - 7$ and Disney+ 8$. The aim of this report is to understand which platform leads in terms of quality content and suits better potential future users, for example depending on the genre preference or age group. The analysis is performed on the dataset available on Kaggle that originally was scraped on the streaming platforms. Specifically, the dataset contains the list of movies available on the 4 streaming pltaforms in 2021. Overall, the dataset contains 9515 row where each row corresponds to a movie and is characterized by the following variables:
ID, unique ID for each movie,
Title, full title of the movie,
Year, release year of the movie, time interval from 1914 to 2021,
Age, factor variable specifying age group of the audience for which the movie is intended with the following levels: all, 7+, 13+, 16+, 18+,
Rotten Tomatoes, score out of 100 that intends to measure the quality of movies provided by Rotten Tomatoes - an american review-aggregation website for film and television,
IMBd, score out of 10 that intends to measure the quality of movies provided by IMBd which is an online database of information related to films and television series, including ratings.
Netflix, binary variable taking 1 if the movie is available on Netflix streaming platform, 0 - otherwise,
Prime Video, binary variable taking 1 if the movie is available on Prime Video streaming platform, 0 - otherwise,
Hulu, binary variable taking 1 if the movie is available on Hulu streaming platform, 0 - otherwise,
Disney+, binary variable taking 1 if the movie is available on Disney+ streaming platform, 0 - otherwise.
TOM
The main dataset used is composed of the following variables :
Title : the film title;
Year : production year;
Age : age limit for viewing (all, +7, +13, +16, +18);
IMDb : IMDb rate (between 0 and 10);
Rotten.Tomatoes : Rotten Tomatoes rate (between 0 and 100);
Netflix : 1 if the film is on Netflix and 0 otherwise;
Hulu : 1 if the film is on Hulu and 0 otherwise;
Prime.Video : 1 if the film is on Prime Video and 0 otherwise;
Disney. : 1 if the film is on Disney+ and 0 otherwise;
Directors : production directors;
Genres : film category;
Country : production country;
Language : production language;
Runtime : total duration.